SMR-Cmp: Square-Mean-Root Approach to Comparison of Monolingual Contrastive Corpora

نویسندگان

  • Huarui Zhang
  • Chu-Ren Huang
  • Francesca Quattri
چکیده

The basic statistic tools used in computational and corpus linguistics to capture distributional information have not changed much in the past 20 years even though many standard tools have been proved to be inadequate. In this demo (SMR-Cmp), we adopt the new tool of Square-MeanRoot (SMR) similarity, which measures the evenness of distribution between contrastive corpora, to extract lexical variations. The result based on one case study shows that the novel approach outperforms traditional statistical measures, including chi-square (χ 2 ) and log-likelihood ratio (LLR).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mechanical specialization of the obliquely striated circular mantle muscle fibres of the long-finned squid Doryteuthis pealeii.

The centrally located, mitochondria-poor (CMP) and superficially located, mitochondria-rich (SMR) circular muscle fibres in the mantles of some squids provide one of the few known examples of specialization in an obliquely striated muscle. Little is known of the mechanical properties or of the mechanisms and performance consequences of specialization in these fibres. We combined morphological a...

متن کامل

Translation and contrastive linguistic studies at the interface of English and Chinese: Significance and implications

Corpora have revolutionized nearly all areas of linguistic research over the past four decades (McEnery, Xiao and Tono 2006; McEnery and Hardie 2012). Translation studies and contrastive linguistics are no exceptions. Indeed, the rapid development of bilingual parallel corpora as well as monolingual and multilingual comparable corpora since the early 1990s has been of particular relevance and c...

متن کامل

Comparison of the Dimensions of Executive Functions in Monolingual and Bilingual Children

Objective: This study aimed to compare the executive functioning between bilingual and monolingual children. Methods: We recruited a total of 200 children, all under 5-years old, who participated in a cross-sectional study. These participants were separated into two groups based on their enrollment in a second language program. Group one consisted of children enrolled in a second language prog...

متن کامل

Comparison of Neural Network Models, Vector Auto Regression (VAR), Bayesian Vector-Autoregressive (BVAR), Generalized Auto Regressive Conditional Heteroskedasticity (GARCH) Process and Time Series in Forecasting Inflation in ‎Iran‎

‎This paper has two aims. The first is forecasting inflation in Iran using Macroeconomic variables data in Iran (Inflation rate, liquidity, GDP, prices of imported goods and exchange rates) , and the second is comparing the performance of forecasting vector auto regression (VAR), Bayesian Vector-Autoregressive (BVAR), GARCH, time series and neural network models by which Iran's inflation is for...

متن کامل

A Contrastive Investigation of Intertextuality in Research Articles Authored by Iranian vs. English Writers in Applied Linguistics

Academic discourse enables others' voices in a text to be realized through conventionalized citational patterns. However, form amongst a variety of factors, one thing which may influence the way others' voices are textualized is writers' affiliations to different cultures. Following this assumption, the present contrastive study attempted to explore manifest intertextual constructions across th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012